Duality-free Methods for Stochastic Composition Optimization
نویسندگان
چکیده
We consider the composition optimization with two expected-value functions in the form of 1 n ∑n i=1 Fi( 1 m ∑m j=1Gj(x))+ R(x), which formulates many important problems in statistical learning and machine learning such as solving Bellman equations in reinforcement learning and nonlinear embedding. Full Gradient or classical stochastic gradient descent based optimization algorithms are unsuitable or computationally expensive to solve this problem due to the inner expectation 1 m ∑m j=1Gj(x). We propose a duality-free based stochastic composition method that combines variance reduction methods to address the stochastic composition problem. We apply SVRG and SAGA based methods to estimate the inner function, and duality-free method to estimate the outer function. We prove the linear convergence rate not only for the convex composition problem, but also for the case that the individual outer functions are non-convex while the objective function is stronglyconvex. We also provide the results of experiments that show the effectiveness of our proposed methods.
منابع مشابه
Optimality and duality theory for stochastic optimization problems with nonlinear dominance constraints
We consider a new class of optimization problems involving stochastic dominance constraints of second order. We develop a new splitting approach to these models, optimality conditions and duality theory. These results are used to construct special decomposition methods.
متن کاملBlock-Coordinate Frank-Wolfe for Structural SVMs
We propose a randomized block-coordinate variant of the classic Frank-Wolfe algorithm for convex optimization with block-separable constraints. Despite its lower iteration cost, we show that it achieves the same convergence rate as the full Frank-Wolfe algorithm. We also show that, when applied to the dual structural support vector machine (SVM) objective, this algorithm has the same low iterat...
متن کاملBlock-Coordinate Frank-Wolfe Optimization for Structural SVMs
We propose a randomized block-coordinate variant of the classic Frank-Wolfe algorithm for convex optimization with block-separable constraints. Despite its lower iteration cost, we show that it achieves a similar convergence rate in duality gap as the full FrankWolfe algorithm. We also show that, when applied to the dual structural support vector machine (SVM) objective, this yields an online a...
متن کاملDuality gaps in nonconvex stochastic optimization
We consider multistage stochastic optimization models containing nonconvex constraints, e.g., due to logical or integrality requirements. We study three variants of Lagrangian relaxations and of the corresponding decomposition schemes, namely, scenario, nodal and geographical decomposition. Based on convex equivalents for the Lagrangian duals, we compare the duality gaps for these decomposition...
متن کاملOptimal Power Generation under Uncertainty via Stochastic Programming
A power generation system comprising thermal and pumped storage hy dro plants is considered Two kinds of models for the cost optimal generation of electric power under uncertain load are introduced i a dynamic model for the short term operation and ii a power production planning model In both cases the presence of stochastic data in the optimization model leads to multi stage and two stage stoc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1710.09554 شماره
صفحات -
تاریخ انتشار 2017